Image processing apparatus, control method thereof, and storage medium that stores program thereof

ABSTRACT

Conventionally, there are problems in that object data extracted from an original document is not object data divided into object units a user desires, and that scan data cannot be handled in the object data units when reusing the scan data. To solve these problems, the image processing apparatus extracts each object included in the original document; stores object data of each extracted object with a keyword in a storage device; and visualizes each object data on a display unit. When making multiple selection of the visualized object data individually in accordance with a user instruction, the keywords associated with the plurality of object data selected are displayed. Besides, it is made possible to change the keywords associated with the plurality of object data selected.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to technology to improve the reusabilityof object data in an image processing apparatus capable of separatingobjects from an original document image.

2. Description of Related Art

Conventionally, there has been a technology as disclosed in JapanesePatent Laid-Open No. 2004-265384, which separates object data from scandata obtained by reading an original document, and applies an editingfunction to the object data. The technology performs the followingprocessing when separating objects from the scan data:

-   (1) Divides the scan data on an object by object basis;-   (2) Classifies each object data in the scan data to an item having    an attribute such as TEXT, GRAPHIC, and IMAGE, for example; and,-   (3) Converts the object data with the attribute of TEXT or GRAPHIC    to vector data by vectorization, and the object data with the    attribute of IMAGE to image data such as JPEG.

The object data separated as described above can be edited using anediting function.

In addition, there is a technology as disclosed in Japanese PatentLaid-Open No. 2006-277473, which adds attribute information data (akeyword) chosen by a user to the scan data afterward for improvingsearch ability.

SUMMARY OF THE INVENTION

The foregoing technology, however, has a problem in that because theobject data are automatically decided and generated in the imageprocessing apparatus, they are not always the object data in objectunits the user desires.

As for the technology disclosed in Japanese Patent Laid-Open No.2006-277473, even though it tries to reuse scan data stored in the imageprocessing apparatus by searching for the scan data using a keyword forsearch, the entire scan data becomes the reusable data. Accordingly,when reusing the scan data, a problem arises in that the scan datacannot be handled in the object data units.

As a result, to reuse the scan data in the object data units, it isnecessary to cut out only the object data from the scan data again, andthis prevents the improvement in the user convenience.

To solve the foregoing problems, the present invention has the followingfeatures.

In the first aspect of the present invention, there is provided an imageprocessing apparatus comprising: means for extracting each objectincluded in an original document, and for storing object data of eachextracted object with a keyword in a storage device; means forvisualizing each object data on a display unit; selecting means formaking multiple selections of the visualized object data individually inaccordance with a user instruction; means for displaying the keywordsassociated with the plurality of object data selected by the selectingmeans; and keyword assigning means for changing the keywords associatedwith the plurality of object data selected.

In the second aspect of the present invention, there is provided acontrol method for an image processing apparatus comprising: a step ofextracting each object included in an original document, and of storingobject data of each extracted object with a keyword in a storage device;a step of visualizing each object data stored in the storage device on adisplay unit; a selecting step of selecting more than one of thevisualized object data in accordance with a user instruction via a GUIon the display unit; a step of displaying on the display unit thekeywords associated with the object data selected; and a keywordassigning step of changing the keywords associated with the plurality ofobject data selected via the GUI on the display unit.

In the third aspect of the present invention, there is provided acomputer readable storage medium that stores a program for causing acomputer for controlling an image processing apparatus to execute: astep of extracting each object included in an original document, and ofstoring object data of each extracted object with a keyword in a storagedevice; a step of visualizing each object data stored in the storagedevice on a display unit; a selecting step of selecting more than one ofthe visualized object data in accordance with a user instruction via aGUI on the display unit; a step of displaying on the display unit thekeywords associated with the object data selected; and a keywordassigning step of changing the keywords associated with the plurality ofobject data selected via the GUI on the display unit.

In the present specification, the term “image processing apparatus”includes, in addition to dedicated image processing apparatuses andimage forming apparatuses, general-purpose information processingapparatuses capable of executing the processing in accordance with thepresent invention.

According to the present invention, it is possible to integrate ordivide the object data the user desires on a screen of the imageprocessing apparatus even if the scan data is divided automatically inthe object units the user does not desire when the image processingapparatus generates the object data. In addition, the user can add adesired keyword or change to a desired keyword on an object data unitbasis afterward. This makes it possible to facilitate the search on anobject unit basis and the reuse of the object data, thereby being ableto improve the user convenience.

Further features of the present invention will become apparent from thefollowing description of exemplary embodiments (with reference to theattached drawings).

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram showing a system configuration of a first embodimentin accordance with the present invention;

FIG. 2 is a block diagram showing a detailed configuration of an MFP ofthe first embodiment in accordance with the present invention;

FIG. 3 is a flowchart showing a processing procedure of creating objectdata and metadata in the first embodiment in accordance with the presentinvention;

FIG. 4 is a diagram for illustrating an object generating process in thefirst embodiment in accordance with the present invention;

FIG. 5 shows an example of the transition of an operation screen in thefirst embodiment in accordance with the present invention;

FIG. 6 shows an example of the transition of an operation screen in thefirst embodiment in accordance with the present invention;

FIG. 7 shows an example of the transition of an operation screen in thefirst embodiment in accordance with the present invention;

FIG. 8 is a diagram showing an operation flow in a second embodiment inaccordance with the present invention; and

FIG. 9 is an example of an input screen for changing a keyword in thesecond embodiment in accordance with the present invention.

DESCRIPTION OF THE EMBODIMENTS First Embodiment

The embodiments in accordance with the present invention will now bedescribed in detail with reference to the accompanying drawings.

FIG. 1 is a block diagram showing a configuration of an image processingsystem of the first embodiment in accordance with the present invention.

The image processing system is implemented in an environment in which anoffice 10 is connected to a network 104 such as the Internet.

An MFP (MultiFunction Peripheral) 100, a client PC 102 using the MFP100, a database 105, and a proxy server 103 are connected to a LAN 107constructed in the office 10. The MFP is a multifunction machine thatimplements a plurality of functions (copying function, printingfunction, transmitting function and the like). In addition, the LAN 107in the office 10 is connected to the network 104 via the proxy server103 of each office.

The configuration in FIG. 1 is only an example: there may be more thanone of each component, and a plurality of offices as shown in FIG. 1 canbe present and interconnected via the network 104. As for the network104, it is a so-called communications network, and it is sufficient thatit can perform data transmission and reception. Typically, the network104 is implemented in the form of a single network or a combination ofnetworks such as telephone circuits, a LAN, a WAN, dedicated digitallines, ATM circuits, frame relay lines, communications satellitecircuits, cable television circuits, and data broadcasting radiocircuits.

In addition, each client PC 102 has standard components (such as a CPU,RAM, ROM, hard disk, external storage device, network interface,display, keyboard, mouse and so on) to be mounted in a general-purposecomputer.

FIG. 2 is a block diagram showing a detailed configuration of the imageprocessing apparatus (MFP 100) of the first embodiment in accordancewith the present invention.

In FIG. 2, the reference numeral 201 designates an image reading sectionincluding an automatic document feeder (ADF). The image reading section201 irradiates an original document with light from a light source (notshown), first, and forms a reflected image of the original document on asolid-state image sensing device. Thus, it acquires the image readsignal output from the solid-state image sensing device as bitmap datawith a prescribed resolution (such as 600 dpi).

In addition, the MFP 100 has a copying function of printing an imagecorresponding to the image read signal on a recording medium with aprinting section 203. When making a single copy of the original documentimage, a data processing section 206 performs the image processing ofthe image read signal to generate a recording signal, and the printingsection 203 that receives the recording signal carries out printing on arecording medium. In contrast, when making a plurality of copies of theoriginal document image, the data processing section 206 causes astorage section 202 to temporarily retain the recording signal of onepage, and causes the printing section 203 to print it on recordingmediums with sequentially outputting the recording signal thereto.

The communication function, via a network I/F 205, makes it possible toconvert the bitmap data obtained from the image reading section 201 tothe object data through processing which will be described later, and totransmit it to the database 105 to be stored. In addition, it also makesit possible to receive the object data stored in the database 105 forreuse, or to convert the object data to an image file with a vector datafile format such as PDF and to transfer to the client PC 102. As astorage location of such data, besides the storage section 202 in theMFP 100 and database 105, a storage device of an external server(neither of which is shown) can be used. In this case, the data to bestored is transmitted to the external server connected through the proxyserver 103 and network 104 shown in FIG. 1 via the network I/F 205 whichis a communication means within the MFP 100, and is stored in thestorage device of the external server. The stored data can be searchedby a search function of each device.

The printing section 203 has a printing function of forming an image ona printing medium in accordance with the print data output from theclient PC 102. To utilize the printing function, the data processingsection 206 receives the print data output from the client PC 102 viathe network I/F 205. Then, after the data processing apparatus 206converts the print data to a recording signal in accordance with whichthe printing section 203 can print, it supplies the recording signal tothe printing section 203, and the printing section 203 forms an image ona recording medium.

Operator instructions to the MFP 100 are provided from an input section204 mounted on the MFP 100, and the operation corresponding to theinstructions is controlled by a control section (not shown) in the dataprocessing section 206. In addition, a display section (display unit)207 displays the state of the operation input and the image data duringthe processing.

The storage section (storage device) 202 provides a storage area forstoring object data acquired by the processing which will be describedlater, and an image processing buffer area used for various types ofimage processing by the data processing section.

[Object Data and Metadata Generating Processing]

Next, the process of separating the data obtained by scanning or thedata input as PDL data into individual objects, and of storing this datawill be described with reference to the flowchart of FIG. 3.

First, at step S301, when the input is a SCAN job, the image readingsection 201 of the MFP 100 scans the original document, and acquires a600 dpi, 8 bit image signal, for example. The data processing section206 performs preprocessing of the image signal, and stores it in thestorage section 202 as the image data (bitmap data) of one page. Whenthe input is described in a page description language (referred to as“PDL” from now on), the image data (vector data) of one page is storedin the PDL.

Next, when the input is scan data, a decision “SCAN” is made at stepS302, and the processing proceeds to step S303. At step S303, the dataprocessing section 206 carries out contents separating processing byperforming block selection (BS). More specifically, the data processingsection 206 divides the image data to be processed which the storagesection 202 stores as text/graphic portions and halftone image portions,first. It further divides the text/graphic portions into blocks, each ofwhich is treated as a paragraph, or into tables and figures consistingof lines, to make objects.

On the other hand, the data processing section 206 divides the half toneimage portions into objects (blocks) independent from each other asblocks, such as image portions and background portions, which consist ofblocks separated in rectangles. Although the present embodiment takesTEXT (text), GRAPHIC (thin line, figure and table) and IMAGE (image) asan example of the attributes, the types of attributes are not limitedthereto. It goes without saying that other types of attributes can beemployed in accordance with applications or purposes, and that it is notnecessary to use all the attributes. In the present embodiment, the term“block selection processing” refers to region dividing processing thatdivides the image into regions (blocks indicating the regions of theindividual objects) for respective attributes by analyzing the image.

Next, at step S304, vectorizing processing is performed on each objectseparated. The vectorizing processing uses an OCR to carry out thecharacter recognition of the objects decided as TEXT attribute,recognizes size, style and font of the characters, and converts thecharacters obtained by scanning the original document to visuallyfaithful font data. On the other hand, the objects such as tables andfigures consisting of lines with the GRAPHIC attribute are outlined. Inaddition, the image objects with the IMAGE attribute are subjected toJPEG compression individually as the image data.

The vectorizing processing of the various objects is executed inaccordance with the object information of the individual objects, andthe results thereof are stored as the object data with a formatconsisting of the metadata and image information which will be describedlater and is available in the MFP 100.

In contrast, when the input is PDL data, a decision “PDL” is made atstep S302 and the processing proceeds to step S305. At step S305, theinput PDL data is converted to object data on an object by object basis,which will be described later, and the object data are stored. In thiscourse, the attribute data (such as TEXT, GRAPHIC and IMAGE), themetadata and the layout information contained in the PDL data are alsoconverted and stored.

The term “metadata” mentioned here refers to the information on theobject such as characteristics (object size, font size and the like) andnames of the objects and to features representing relative relationshipswith other objects in the original document to be used by the editingprocessing which will be described later.

Next, at step S306, the data processing section 206 performs imageanalysis and document analysis for each object, thereby obtaininginformation indicating the contents of the object itself. In addition,it corrects the metadata generated at step S304 and step S305, and addsnew information. First, considering that the SCAN job and PDL jobgenerate different object information, the present embodiment preparesthree attributes of TEXT, GRAPHIC and IMAGE so that the same sort ofimage data undergo the same processing from this point forward. When theattribute of the object is TEXT, a summary of the document is made fromthe results of the document analysis and is added to the metadata. Whenthe object information is GRAPHIC, the image analysis is made to extractfeatures and more detailed information such as name, diagram and linedrawing, and the information is added to the metadata. When the objectinformation is IMAGE, the image analysis is carried out to extractfeatures and more detailed information such as name, portrait andlandscape, and the information is added to the metadata. However, theinformation added to the metadata here is only an example, and is notlimited to it. Any information can be added as long as it relates to theindividual objects.

Next, at step S307, the object data acquired in the processing up to theforegoing step S306 are stored in the storage section 202 or database105, whereby the object data are registered. The object data is notnecessarily registered. For example, if the storage section 202 hasalready stored the same or similar object data, the registering isomitted, or another decision processing can be made as to whether theobject data is to be registered or not.

[Object Data and Metadata]

Next, the object data and the metadata will be described with referenceto FIG. 4.

The reference numeral 403 of FIG. 4 designates an example of a datastructure of the object data in the present embodiment.

The object data 403 is composed of layout information 404 describing itsstructure, each object data 405 with an attribute of IMAGE, GRAPHIC orTEXT, and metadata 406 describing keyword information for searching eachdata 405 and the like. The metadata 406 included in the object data 403in this way can associate each object data 405 with the keywordregistered for each object data.

Each object data 405 of the IMAGE, GRAPHIC or TEXT and the metadata 406are acquired by performing the object data generating processingdescribed above on a paper original document 401 including the objects402 with the attributes of IMAGE, GRAPHIC and TEXT.

[Editing Function 1 of Object Data]

Next, the editing function of the object data in the present embodimentwill be described with reference to FIG. 5, FIG. 6, FIG. 7 and FIG. 9.

FIG. 5, FIG. 6 and FIG. 7 are diagrams each showing an example of theoperation screen in the present embodiment. In particular, the operationscreen based on a GUI (graphical user interface) is an example of theoperation screen of the operation section consisting of the inputsection 204 and display section 207 shown in FIG. 2.

The operation screen 501 is shown as a configuration example in whichthe input section 204 and the display section 207 are integrated. It isassumed in the example that the input section 204 and the displaysection 207 are composed of a touch screen and an LCD, respectively. Itgoes without saying that they can be independently composed of hard keys(a keyboard) or a pointing device, such as a mouse pointer, as the inputsection 204, and a CRT as the display section 207 and the like.

The operation screen 501 of FIG. 5 is an example of displaying adocument stored in the storage section 202 after performing the “objectdata and metadata generating processing” described above on the scandata obtained by scanning the paper original document 401 with the MFP100. As shown in FIG. 5, a state in which the scan data is divided intothe individual object data of IMAGE, TEXT and GRAPHIC can be visualizedand displayed as a thumbnail image 502. In addition, each object can beselected freely by touching the object in the thumbnail image 502 on thedisplay unit.

Here, if a user selects Image Object 1 in the thumbnail image 502 andpresses a keyword change button 503, the screen makes the transition anda keyword 506 corresponding to the selected object is displayed (see thelower portion of FIG. 5). The keyword displayed here is the keywordrecorded in the metadata 406 included in the foregoing object data 403.Although only a single keyword is displayed in the explanation here, ifa plurality of keywords is registered, it is also possible to displaythem.

In addition, if the user presses the keyword change button 507 when thekeyword 506 is displayed, the keyboard shown in FIG. 9 is displayed.This enables the user to change the keyword 506 freely by inputting akeyword the user wishes, thereby improving the search ability by thekeyword.

Although the keyword change for only one object is possible in theexample of FIG. 5, the user can select a plurality of objects as shownin FIG. 6 (capable of multiple selection). If the selection of theplurality of objects is allowed, and the user makes the multipleselection, a list of a plurality of keywords 601 corresponding to theplurality of objects selected is displayed (see the lower portion inFIG. 6). In this case, the display is made in such a manner that theuser can recognize which keyword is assigned to which object selected.Then, when the user presses a simultaneous change button 602, thekeyboard shown in FIG. 9 is displayed, and the user can input a desiredkeyword via the keyboard, which makes it possible to change collectivelythe plurality of keywords 601 to desired keywords. In this case, it isalso possible to utilize the keyword assigned before simultaneous changeto each object. The function can improve the user's convenience when heor she wants to change the keywords of the related objects to the samekeyword collectively. Although the present embodiment demonstrates onlyan example of selecting a plurality of objects on a page, using a nextpage button 505 enables simultaneous selection of a plurality of objectson a plurality of pages, thereby enabling simultaneous change of thekeywords.

Next, an example of integrating individual objects will be describedwith reference to FIG. 7.

When the user presses an object change button 504, selects a pluralityof objects 701 displayed in the thumbnail image 502, and presses anintegration button 702 of the objects, the selected objects areintegrated into a single object 703. Although a new keyword must beregistered in connection with the integration of the objects, it is alsopossible to choose an old keyword before the integration. In this case,keywords 704 having been registered before the integration are displayedso that the user can select one or more keywords, and the keywordsselected are assigned to the objects. In addition, although an exampleis explained of the user to select the keywords every time theintegration processing is made, this is not essential. For example, aconfiguration is also possible which makes a setting in advance on aninitial setting screen in such a manner as to choose all the oldkeywords of the objects to be integrated, and which automaticallyassigns them to the object after the integration as the initialkeywords.

To assign a brand-new keyword, the user can press a keyword changebutton 705, and register any desired keyword from the keyboard screen ofFIG. 9. Thus, even if the individual object data have not been dividedinto object units the user desires, the user can reintegrate into anydesired object afterward, which improves the reusability of the objectdata.

As for the object data 403, although it is stored in the storage section202 in the MFP 100 in the present embodiment, the database 105 on thenetwork can also be used as a destination for storage.

Second Embodiment

Next, a second embodiment in accordance with the present invention willbe described.

Configurations of the image processing system and MFP in the presentembodiment are the same as those of the image processing system (FIG. 1)and MFP (FIG. 2) described in the foregoing first embodiment. Inaddition, as for “object data and metadata generating processing” and“object data and metadata”, they are the same as the correspondingcontents described in the first embodiment. Accordingly, theirdescription is omitted here, and only the editing function of the objectdata will be described below.

[Editing Function 2 of Object Data]

The editing function of the object data in the present embodiment willbe described below with reference to FIG. 8 and FIG. 9.

FIG. 8 is a diagram showing an example of the operation screen in thepresent embodiment. In particular, the operation screen based on a GUIis an example of the operation screen of the operation sectionconsisting of the input section 204 and display section 207.

The operation screen 501 is shown as a configuration example of theoperation screen of the operating section in which the input section 204and the display section 207 are integrated. An example is assumed inwhich the input section 204 and the display section 207 are composed ofa touch screen and an LCD, respectively. It goes without saying thatthey can be independently composed of hard keys (a keyboard) or apointing device like a mouse pointer as the input section 204, and a CRTas the display section 207 and the like.

The operation screen 501 of FIG. 8 is an example of displaying adocument stored in the storage section 202 after performing the “objectdata and metadata generating processing” described above on the scandata obtained by scanning the paper original document 401 with the MFP100. The operation screen 501 can display the state in which the scandata is divided into a plurality of object data with the attribute ofone of IMAGE, TEXT and GRAPHIC as a thumbnail image 502. In addition,each object can be selected freely by touching the object in thethumbnail image 502.

Here, if the user presses the object change button 504, the screen makesthe transition. After selecting the Image Object 1 (801) in thethumbnail image 502 of the screen after the transition (refer to themiddle of FIG. 8), the user assigns any desired value to a divisionlevel 802. Then, when the user presses an object division button 803,the division of the object selected becomes possible at the specifieddivision level (division number).

In the present embodiment, the division level indicates the number ofobjects after the object division. Thus, if it is set at 2, the object801 is divided into two parts as denoted by the reference numeral 804.As for a dividing line indicating the position for dividing the object,it is assumed to be determined at a desired boundary of the parts byanalyzing the histograms in horizontal and vertical directions of eachobject, and/or by analyzing continuity of pixels of the same color (suchas concatenated black pixels). In addition, when carrying out thedivision processing, the MFP 100 sets (registers) the keyword that hasbeen assigned to the object (801) before the division as the initialkeyword of each object after the division. After that, if the userrequires changing the keyword assigned to the object after the division,the user can change the keyword by pressing the keyword change button503. Thus, if the objects have been formed in units greater than theuser desires, the user can divide the object into a plurality of objectsas he or she desires. The function can further improve the reusabilityof the object data.

Other Embodiments

The present invention is further applicable not only to a systemconsisting of a plurality of devices (such as a computer, an interfaceunit, a reader and a printer), but also to an apparatus consisting of asingle device (such as a multifunction machine, a printer or a faxmachine).

In addition, the object of the present invention can be achieved byreading a program code for implementing the procedures of the flowchartsshown in the embodiments described above from a storage medium in whichthe program code is stored, and by executing the program code with acomputer (or CPU or MPU) of the system or apparatus. In this case, theprogram code itself read from the storage medium implements thefunctions of the foregoing embodiments. Accordingly, the program code ora computer readable storage medium that stores/records the program codeconstitutes the present invention as well.

As the storage medium for supplying the program code, a floppy(registered trademark) disk, hard disk, optical disk, magneto-opticaldisk, CD-ROM, CD-R, magnetic tape, nonvolatile memory card, ROM and thelike can be used.

The functions of the foregoing embodiments are implemented by executingthe read-out program with the computer. In addition, the term “executionof the program” includes the case where an OS and the like working onthe computer perform part or all of the actual processing according tothe instructions of the program.

Furthermore, the functions of the foregoing embodiments can also beimplemented by a function expansion board inserted into a computer or afunction expansion unit connected to the computer. In this case, theprogram read out of the storage medium is written into a memory in theexpansion board inserted to the computer or into a memory in theexpansion unit connected to the computer. After that, according to theinstructions of the program, the CPU in the expansion board or in theexpansion unit executes part or all of the actual processing. Suchprocessing by the function expansion board or function expansion unitcan also implement the functions of the foregoing embodiments.

While the present invention has been described with reference toexemplary embodiments, it is to be understood that the invention is notlimited to the disclosed exemplary embodiments. The scope of thefollowing claims is to be accorded the broadest interpretation so as toencompass all such modifications and equivalent structures andfunctions.

This application claims the benefit of Japanese Patent Application No.2007-197733, filed Jul. 30, 2007, which is hereby incorporated byreference herein in its entirety.

1. An image processing apparatus comprising: means for extractingobjects included in an original document; obtaining means for obtaininga keyword for each of the extracted objects by performing at least oneof image analysis and document analysis for each of the extractedobjects; means for storing object data of each of the extracted objectswith the keyword that is obtained for each of the extracted objects bythe obtaining means in a storage device; visualizing means forvisualizing the objects included in the original document based on thestored object data on a display unit; selecting means for selecting atleast two objects from the visualized objects in accordance with a userinstruction; keyword-displaying means for simultaneously displaying thekeywords stored with the at least two objects selected by said selectingmeans, wherein the keyword-displaying means does not display thekeywords stored with objects that are not selected by said selectingmeans; and keyword assigning means for changing the displayed keywordsinto inputted keywords based on a user instruction, thereby collectivelychanging the keywords being stored with the object data of the selectedat least two objects into the inputted keywords.
 2. The image processingapparatus as claimed in claim 1, further comprising integration meansfor integrating the plurality of object data corresponding to the atleast two objects selected by said selecting means into one object data,wherein said keyword assigning means assigns, when said integrationmeans carries out integration, a keyword to the object data after theintegration.
 3. The image processing apparatus as claimed in claim 1,wherein said selecting means selects one of the visualized objects inaccordance with the user instruction; said image processing apparatusfurther comprises division means for dividing the one object selected bysaid selecting means into a plurality of objects, the keyword that hasbeen assigned to the selected one object being set as the initialkeyword of each object after the division, wherein said keywordassigning means assigns, when the division means carries out thedivision, an inputted keyword to each of the divided objects.
 4. Theimage processing apparatus as claimed in claim 1, further comprisingmeans for searching for the object data in accordance with the keywordassociated with the object data.
 5. The image processing apparatus asclaimed in claim 1, wherein said selecting means selects the at leasttwo objects from a plurality of pages in accordance with the userinstruction.
 6. The image processing apparatus as claimed in claim 1,further comprising communication means for transmitting the object dataand keyword to a storage device of an external server to be stored. 7.The image processing apparatus as claimed in claim 3, further comprisingmeans for designating a number of division at a time of dividing theobject data by said division means.
 8. A control method for an imageprocessing apparatus, comprising: a step of extracting objects includedin an original document a step of obtaining a keyword for each of theextracted objects by performing at least one of image analysis anddocument analysis for each of the extracted objects; a step of storingobject data of each of the extracted objects with the keyword that isobtained for each of the extracted objects by the step of obtaining in astorage device; a step of visualizing the objects included in theoriginal document based on the stored object data stored in said storagedevice on a display unit; a selecting step of selecting at least twoobjects from the visualized objects in accordance with a userinstruction via a GUI on said display unit; a step of simultaneouslydisplaying on said display unit the keywords stored with the at leasttwo objects selected by the step of selecting, wherein the step ofdisplaying does not display the keywords stored with objects that arenot selected by the step of selecting; and a keyword assigning step ofchanging the displayed keywords into inputted key words based on a userinstruction, thereby collectively changing the keywords being storedwith the object data of the selected at least two objects into theinputted keywords.
 9. A computer readable storage medium thatretrievably stores a program for causing a computer for controlling animage processing apparatus to execute: a step of extracting objectsincluded in an original document a step of obtaining a keyword for eachof the extracted objects by performing at least one of image analysisand document analysis for each of the extracted objects; a step ofstoring object data of each of the extracted objects with the keywordthat is obtained for each of the extracted objects by the step ofobtaining in a storage device; a step of visualizing the objectsincluded in the original document based on the stored object data storedin said storage device on a display unit; a selecting step of selectingat least two objects from the visualized objects in accordance with auser instruction via a GUI on said display unit; a step ofsimultaneously displaying on said display unit the keywords stored withthe at least two objects selected by the step of selecting, wherein thestep of displaying does not display the keywords stored with objectsthat are not selected by the step of selecting; and a keyword assigningstep of changing the displayed keywords into inputted key words based ona user instruction, thereby collectively changing the keywords beingstored with the object data of the selected at least two objects intothe inputted keywords.